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Abstract 

The present study reports the large-scale discovery of genome-wide single-nucleotide polymorphisms 
(SNPs) in chickpea, identified mainly through the next generation sequencing of two genotypes, i.e. 
Cicer arietinum ICC4958 and its wild progenitor C. reticulatum P1489777, parents of an inter-specific 
reference mapping population of chickpea. Development and validation of a high-throughput SNP geno- 
typing assay based on lllumina's GoIdenGate Genotyping Technology and its application in building a 
high-resolution genetic linkage map of chickpea is described for the first time. In this study, 1022 SNPs 
were identified, of which 768 high-confidence SNPs were selected for designing the custom Oligo Pool 
All (CpOPA-l) for genotyping. Of these, 697 SNPs could be successfully used for genotyping, demonstrating 
a high success rate of 90.75%. Genotyping data of the 697 SNPs were compiled along with those of 368 
co-dominant markers mapped in an earlier study, and a saturated genetic linkage map of chickpea was 
constructed. One thousand and sixty-three markers were mapped onto eight linkage groups spanning 
1808.7 cM (centiMorgans) with an average inter-marker distance of 1.70 cM, thereby representing one 
of the most advanced maps of chickpea. The map was used for the synteny analysis of chickpea, which 
revealed a higher degree of synteny with the phylogenetically close Medicago than with soybean. The 
first set of validated SNPs and map resources developed in this study will not only facilitate QTL 
mapping, genome-wide association analysis and comparative mapping in legumes but also help anchor 
scaffolds arising out of the whole-genome sequencing of chickpea. 
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1 . Introduction 

Discovery of the DNA sequence polymorphism is a 
prerequisite to generating genetic markers for various 
applications of modern genomics research such as 
map-based cloning, marker-assisted breeding, associ- 
ation mapping and understanding genome structure 
and function. Among the sequence polymorphisms, 
single-nucleotide polymorphisms (SNPs) represent 
the most abundant type of variation present in DNA. 
SNPs are mostly biallelic, 1 co-dominantly inherited, 



sequence tagged and occur at high density within 
genomes. 2 They are thus amenable to the development 
of genetic molecular markers at low cost, which can 
provide sufficiently dense genome coverage for the 
dissection of complex traits. SNPs are known to occur 
at high frequencies of ~1 per 500-1 000 bp in 
humans 3,4 and in plant genomes also, where their fre- 
quency appears to vary significantly, e.g. 1 SNP per 
1 6 bp in eucalyptus, 5 1 per 1 07 bp in radish, 6 1 per 
147 bp in rice, 7 1 per 200 bp in maize, 8 1 per 3 70 bp 
in soybean genes 9 and 1 per 500 bp \nArabidopsis}° 
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Recently, technological advancements have acceler- 
ated the genome-wide SNP discovery not only in 
model species, but also in crop plants. Initial efforts 
relied on the discovery of large numbers of SNPs from 
EST databases such as in grapevine, 11 spruce, 12 
cowpea, 13 pea, 14 wheat 15 and pine. 16 However, more 
recently, with the development of the next generation 
sequencing (NGS) platforms which enable the sequen- 
cing of millions of bases at deep coverage, thousands of 
SNPs have been identified in many species such as 
maize, 17 soybean, 18 Medicago,^ 9 Eucalyptus, 20 rice 21 
and sunflower. 22 To utilize the millions of available 
SNPs, various high-throughput SNP genotyping plat- 
forms were simultaneously developed such as the 
GoldenGate Genotyping Technology (GGGT; lllumina, 
San Diego, CA, USA), 23 BeadChip-based Infinium 
assay (lllumina), 24,25 SNPStream (Beckman Coulter, 
USA), 26 GeneChip (Affymetrix, USA) 27,28 and competi- 
tive allele-specific PCR, KASPar (KBio science, UK) 29 
that allow large-scale genotyping of SNPs in parallel in 
a large set of individuals. 30 These approaches vary in 
terms of sensitivity, reproducibility, accuracy, capability 
of multiplexing and throughput. Among them, one of 
the most versatile SNP genotyping platforms is the 
lllumina GGGT, which is capable of multiplexing from 
96 to 1 536 SNPs per assay in a single reaction over a 
3-day period. 23 This technology has persistently been 
reported as highly reliable, with high SNP conversion 
rates and has fostered genetic research in several 
major as well as in orphan plants, especially self-pollin- 
ating crops that have been plagued by a narrow genetic 
base with nucleotidediversitiesfrom 0.2 to fJ.5%, 1 3,31 " 
33 as well asthe cross- pollinated species with higherse- 
quence diversities of >2%? 2 > 34 > 35 High-throughput 
SNP genotyping has enabled various applications in 
plant genomics especially genome-wide association 
studies and linkage disequilibrium studies, 1 0,36-38 
synteny-based comparative genomics 1 3,39 and high- 
resolution genetic mapping. 40,41 A high-density genetic 
linkage map is one of the most important genomic 
tools to accelerate marker-assisted breeding. Recent 
studies have reported the successful utilization of high- 
throughput GGGT for SNP genotyping to build genetic 
linkage maps in legumes such as cowpea, 13 soy- 
bean 18,31 and pea, 14 and even in plants with highly re- 
petitive or polyploid genomes like barley, 32 pine, 34 
grass 41 and maize. 42 Production of high-density SNP- 
based maps have facilitated the fine mapping and 
cloning of agronomically important genes and also 
anchoring and orienting the scaffolds generated by 
whole genome sequence assembly data. 1 8,41 ,43 

Chickpea (C. arietinum L), a diploid (2n = 2x = 1 6), 
annual, self-pollinated crop, with a genome size of 
740 Mb, 44 representstheworld'sthird most important 
legume crop that is mainly grown in the arid and 
semi-arid regions of Asia and Africa. It serves as a key 



source of protein in human nutrition and also plays 
an important role in the maintenance of soil fertility 
owing to its ability to fix atmospheric nitrogen. Due 
to its economic importance, there has been a recent 
spurt in chickpea genomics research and a large 
number of genomic resources, such as molecular 
markers and linkage maps, 45-48 ESTs 49-52 and NGS- 
based transcriptomes, 53-55 have become available. 
Currently, there are ~2 000 co-dominant molecular 
markers available, which include ~980 genomic 
simple sequence repeats (gSSRs), 45,46,56 " 62 and EST- 
derived markers including 361 EST-SSRs, 238 Intron- 
Targeted Primers (ITPs), 1 09 Expressed Sequence Tag 
Polymorphisms (ESTPs) and 294 Cleaved Amplified 
Polymorphic Sites/SNPs. 45,47 " 49,51,63 Using these 
co-dominant markers, linkage maps that have been 
generated define ~500 mapped positions. 45-48,58,64 
Moreover, from the recent chickpea transcriptome se- 
quencing, even though ~5000 molecular markers 
have been reported, 54,55 the validation and genotyp- 
ing of these molecular markers is yet to be undertaken. 
Despite this increased availability of genomic 
resources, the large-scale discovery and utilization of 
SNPs, which serve as the most potential markers for 
providing sufficiently dense genetic maps, has not 
been carried out in chickpea. Therefore, in the 
present study, the large-scale identification of 
genome-wide SNPs was undertaken from the NGS of 
two parents of a mapping population (C. arietinum cv. 
ICC4958 and C. reticulatum PI489777). Conversion 
of SNPs into successful genotyping assays based on 
the lllumina GoldenGate technology was demon- 
strated for the first time in chickpea. Moreover, the 
SNP markers were successfully mapped in the backdrop 
of previously reported co-dominant markers to gener- 
ate one of the most comprehensive and dense genetic 
maps of chickpea. 

2. Materials and methods 

2. /. Plant material and DNA isolation 

For SNP discovery, the two genotypes C. arietinum 
ICC4958 (fusarium wilt resistant and drought toler- 
ant) and C. reticulatum PI4 897 77 (wild annual 
species, fusarium wilt susceptible), which are parents 
of a mapping population, were sequenced. Nuclear 
DNA from these two genotypes was isolated using 
the protocol of Malmberg et al. 65 and used for se- 
quencing. For SNP genotyping and linkage map gener- 
ation, the internationally accepted reference mapping 
population consisting of 129 RILs (recombinant 
inbred lines) arising from an inter-specific cross 
between the two genotypes mentioned above was uti- 
lized. Genomic DNA was isolated from fresh young 
leaves of the two mapping parents and the 1 29 RILs 



No. 5] 



R. Gaur et al. 



359 



using the GenElute™ plant genomic DNA Miniprep 
kit (Sigma). The DNA quality was checked by electro- 
phoresis on 0.8% agarose gels. For SNP genotyping, 
the DNA was quantified using Quant-iT™ Pico 
Green® dsDNA Kit (Invitrogen) and the fluorescence 
was measured with the Microtiter plate reader 
(Varioscan from Thermo Scientific). Samples were 
adjusted to 50 ng/|xl using Tris-EDTA buffer. The 
GoldenGate genotyping assay was performed using 
2 50 ng of DNA from each of the RILs. 

2.2. NGS and SNP discovery 

SNP discovery was based on the high-throughput 
next generation whole genome sequencing of the 
two genotypes, i.e. C. arietinum cv. ICC49 58 and its 
wild relative C. reticulatum cv. PI489777. Sequencing 
of C. arietinum ICC49 58 was carried out primarily 
by the 454/Roche GS FLX Titanium platform, for 
which DNA sample preparation and construction of 
whole-genome shotgun (WGS) and matepair libraries 
were performed as described by the manufacturer, 66 
with some modifications. Briefly, the nuclear DNA 
was nebulized and size-selected for a 300-900-bp 
fragment size using Agencourt AMPure SPRI beads 
and ligated with specific adapters A and B at each 
end, which resulted in generating a single-stranded 
or a double-stranded library. One single-stranded 
library using the standard library preparation kit and 
two double-stranded libraries using a rapid library 
preparation kit with an average insert size of 705 
bases were used for sequencing. The libraries were 
amplified by emPCR. 66 DNA molecules that contain 
a single A and B adapters at each end were enriched 
with capture beads coated with capture DNA. 
Sequencing was performed using GS FLX Titanium 
Sequencing kits following the manufacturer's protocol 
(Roche Applied Sciences, Mannheim, Germany). The 
generated Roche 454 reads were submitted to the 
Short Read Archive (SRA) database of NCBI (www. 
ncbi.nlm.nih.gov/sra) and assigned the accession 
number SRA053228.1. These reads were assembled 
using the de novo assembly tool Newbler (GS de novo 
assembler, Roche Applied Sciences) version 2.3 to 
obtain the reference sequence. Further, two genomic 
DNA libraries of 3 kb average insert size, of both the 
parents (C. arietinum ICC4958 and C. reticulatum 
PI489777) were also prepared by the SOLiD Opti 
Mate-paired library kit following the manufacturer's 
protocol and sequenced using the SOLiD 4.0 platform 
(Applied Biosystems, Foster City, CA, USA). These reads 
were assigned the accession number SRA053197.5 
by the SRA of NCBI. SNPs were identified by aligning 
and mapping the SOLiD reads of ICC4958 and 
PI489777 onto the Roche assembly using the tool 
HAPS (http://solidsoftwaretools.com/gf/project/haps/). 



For the selection of SNPs the following criteria were 
used: (i) non-reference alleles with a minimum 
throughput of 5X and score 0.6 were selected, (ii) the 
non-reference alleles with any variant (SNPs or 
indels) within 60 bases upstream or downstream of 
the consensus reference sequences were discarded 
and (iii) one SNP from each contig was chosen to 
ensure genome-wide distribution. 

The second approach for SNP discovery was ampli- 
con resequencing. This involved the identification of 
SNPs from chickpea ESTs generated earlier in the la- 
boratory and is described in Choudhary et al. 48 A set 
of 2 22 EST-derived primers were used to PCR 
amplify and resequence genomic DNA of the two par- 
ental lines (C. arietinum cv. ICC49 58 and C. reticula- 
tum cv. PI489 777) of the reference mapping 
population. 

2.3. Development of the GoldenGate SNP genotyping 
assays 

SNP genotyping was carried out using the lllumina 
GGGT For developing the GGGT assay, the identified 
SNPs along with the 60 bp sequence flanking it on 
either side were submitted to lllumina for processing 
by lllumina's Array Design Tool (ADT) in order to 
obtain a designability rank score for each SNP 
ranging from 0 to 1. This score serves as a reliability 
metric for testing whether or not a particular SNP 
will convert into a successful working GoldenGate 
assay where SNPs with the scores of 0.6 or higher 
possess a high probability of converting into a success- 
ful genotyping assay. Thus, SNPs having the highest 
ADT scores were selected for designing the custom 
Oligo Pool All (CpOPA) containing the allele-specific 
and locus-specific oligos for use in the lllumina 
GoldenGate assay and the first chickpea 768-OPA 
was generated and designated CpOPA-l. The list of 
SNPs and their flanking regions that constituted 
CpOPA-l are presented in Supplementary Table S1. 
Genotyping of SNPs was performed using lllumina's 
BeadArray Express Reader according to the standard 
manufacturer's protocol, 23,31 with 250 ng (50 ng/ 
ixl) of genomic DNA from each individual RIL of the 
mapping population. The automatic allele calling for 
each locus was inferred with the GenomeStudio 
Software V2011.1 (lllumina). The fluorescence 
images of array matrix carrying Cy3- and Cy5-labelled 
beads were generated with the two-channel scanner. 
A genotype that is homozygous for one SNP allele 
will display a signal in either the Cy3 or the Cy5 
channel, whereas a genotype that is heterozygous 
will display a signal in both channels. The intensity 
data were loaded in GenomeStudio, and cluster posi- 
tions were assigned to all genotypes at each SNP and 
these were checked manually for errors and rescored 
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while designating homozygous and heterozygous 
clusters. Further, the call rates and the GenTrain and 
GenCall score generated using the GenomeStudio 
software were analysed for SNP reliability. The 
GenTrain and GenCall50 score of >0.4 and the call 
rate of >95% were used as the lllumina recom- 
mended thresholds for declaring a reliable SNP. 



2.4. Construction of the genetic linkage map 

The inter-specific reference mapping population 
consisting of 1 29 RILs described above was used for 
genotyping and map generation. For linkage analysis, 
the GoldenGate genotyping data of the successful 
SNPs were utilized. In addition to this, the genotyping 
data of 368 polymorphic markers (52 EST-SSRs, 51 
ITPs, 25 ESTPs, 2 MtESTs, 130 gSSRs and 108 STMS; 
Table 1) recently mapped by Choudhary et at. 48 
using the same reference mapping population were 
also combined with the SNP genotyping data for 
anchoring and integrating the previously mapped 
co-dominant markers. This complete marker data 
set was employed for linkage analysis using Joinmap 
ver. 4.0. 67 A x 2 test (P<0.05) was performed for 
the conformity to the expected Mendelian segrega- 
tion ratio of 1:1. Markers showing segregation distor- 
tion were also integrated into the map. Markers were 
ordered with the regression mapping algorithm and 
were classified into LGs using the grouping module 
at LOD thresholds of 8.0-10.0 at an increment of 
0.5. Linkage groups were determined at LOD 9.5 
with a recombination frequency smaller than 0.49 
and a maximum threshold value of 5 for the jump. 
The best marker order was determined using the 
'Ripple' function (value of 1 ). Recombination frequen- 
cies were converted to map distances in centiMorgans 
(cM) using the Kosambi mapping function. 68 



Table 1. Markers utilized for construction of the inter-specific 
linkage map of chickpea (C. arietinum ICC4958 x C. reticulatum 
PI489777) 





Markers 
analysed 


Polymorphic markers 
used for mapping (%) 


Markers 
mapped 


SNPs (from 
this study) 


768 


697 (90.75) 


696 


EST-SSRs 48 


1 85 


52 (28.1) 


51 


ITPs 48 


1 51 


51 (33.8) 


51 


ESTPs 48 


1 09 


25 (22.9) 


25 


MtESTs 90 


1 5 


2 (1 3.3) 


2 


gSSRs 48 


310 


1 30 (41.9) 


1 30 


STMS 

markers 48,58 


1 08 


1 08 


108 


Total 


1 646 


1 065 (64.7) 


1 063 
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2.5. Synteny 

Synteny analysis of chickpea was carried out using 
genomic (gSSRs and SNPs) and genie markers (EST- 
SSRs, ITPs and ESTPs) from the current chickpea map. 
Marker sequences were aligned against the chromo- 
some-based assembly of Medicago HapMap Mt 3.5 
database [Medicago truncatula HapMap Project 2010 
(http://www.medicagohapmap.org/downloads.php)] 
as well as with Glycine max, JGI Glymal [Soybean 
Genome Project (http://www.phytozome.net/ 
soybean)] using local Blastn with an E-value of 
<1e-05. Synteny was visualized using MapChart 2.2 
(Voorrips 2002). All the Blastn individual hits corre- 
sponding to markers from the chickpea LGs (CaLG1- 
8) were parsed into the Microsoft Excel and the 
whole genome dot-blots were developed by using the 
physical location (Mb) of M. truncatula and G. max 
sequences and the genetic location (cM) of chickpea 
sequences. 

3. Results 

3.1 . SNP identification 

SNPs were identified from the whole genome se- 
quencing of the two parents of the chickpea reference 
mapping population namely C. arietinum ICC4958 
and its wild progenitor C. reticulatum PI489777. 
Nuclear genomes of the parents were sequenced to 
various depths using the NGS platforms, i.e. the Roche 
454 GSFLX Titanium and the Applied Biosystem's 
SOLiD platforms and compared for SNP identification. 
WGS libraries (three) of C. arietinum cv. ICC4958 
nuclear DNAwere used for 1 5 runs generating a total 
of 20.50 million filtered reads with 7.674 Gb high- 
quality (Phred score 20) bases. All the filtered reads 
generated by the 454 GS FLX platform were assembled 
by the de novo assembly tool Newbler (GS de novo as- 
sembler, Roche Applied Sciences) version 2.3 to 
obtain the primary assembly. The assembly parameters 
used were seed step 1 2, minimum overlap 40, 
minimum overlap identity 95% and seed length 16, 
with default quality filter, adapter and primer trimming 
of the input reads. The assembly yielded a total length 
of 350 724 940 bases in 160 883 large contigs 
(>500 bases) of average size 21 80 bp and the N50 
value of 3541 bp with 98.66% of the bases having 
quality of Q40 or more. Of these, 1000 largest 
contigs ranging in size from 36 to 1 3 kb were chosen 
as the reference sequence for drawing out the single- 
nucleotide changes. Further, 3 kb paired end libraries 
of both the parents (C. arietinum ICC4958 and C. reti- 
culatum PI489777) were sequenced using the SOLiD 
(Applied Biosystems) platform to generate 2 8 Gb 
(37.8X) and 1 2 Gb (1 6.2X) high-quality paired short 
reads (50 bases), respectively. SNPs in C. reticulatum 
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PI489777 were identified using the SOLiD reads. First, 
the SOLiD reads of ICC4958 were aligned on 
the previously described ICC4958 reference 
sequence by the Bioscope module of the tool HAPS 
(http://solidsoftwaretools.com/gf/project/haps/). The 
mapping output file of only the properly paired reads 
was then utilized by the SST scaffolding module of 
HAPS to produce a SOLiD read-based consensus 
sequence of ICC4958. Next, the SOLiD reads of 
C. reticulatum PI489777 were mapped onto the con- 
sensus sequence for SNP identification. Stringent cri- 
teria were used for SNP calling and only those SNPs 
were selected that had identical SNPs of 5X coverage 
and 0.6 confidence score and whose flanking regions 
were suitable for designing allele-specific and locus- 
specific primers for the lllumina GoldenGate Assay. 
These included only those SNPs which did not have 
other SNPs or indels in the 60-bp regions flanking it. 
Hence, a set of 920 candidate SNP loci were selected 
from the same numberof contigs and were designated 
as CaSNPs. Additionally, by amplicon resequencing of 
222 chickpea EST loci from C. arietinum and C. reticula- 
tum, 1 02 EST-derived SNPs (designated as ESNPs) were 
identified for genotyping as described in Choudhary 
et al. 48 Thus, from both the approaches, a total of 
1 022 SNPs (920 CaSNPs + 1 02 ESNPs) were identified 
for genotyping using the GoldenGate assay. All the 
1 022 SNPs and their flanking sequences are available 
in Supplementary Table S1 . Moreover, the 1 022 iden- 
tified SNPs were classified as transitions (C/T and G/ 
A) or transversions (C/G, T/A, A/C and G/T) based on 
nucleotide substitutions. In this analysis, higher rates 
of transition (almost double) in comparison with the 
rate of transversions were found, since 650 (63.65%) 
transitions were found in comparison with 372 
(36.4%) transversions. Among the transitions, the 
[C/T] type was slightly more prevalent (334, 51.4%) 
than [G/A] that accounted for 31 6 (48.6%) of single 
base changes, whereas among transversions, T/A 
scored relatively higher (123) in comparison with 
other transversions. 



3.2. SNP genotyping 

The 1 022 SNPs were submitted to lllumina for the 
Oligo Pool All (OPA) design for use in the GoldenGate 
assay. Each of the individual SNPs was assigned an ADT 
designability score by lllumina, ranging from 0 to 1, 
where a score <0.4 predicted a low success rate, 0.4 
to 0.6 a moderate success rate and >0.6 a high 
success rate for the conversion of an SNP into a suc- 
cessful GoldenGate assay. In all, 944 SNPs (851 
CaSNPs and 93 ESNPs) were assigned designability 
scores >0.6, thereby resulting in the possibility of 
conversion of 92.37% of the predicted SNPs into suc- 
cessful GoldenGate assays. However, for convenience 



of customizing the array design, the 768 SNPs with 
the highest scores (>0.76) were selected for inclusion 
in the first chickpea 768-OPA, named as CpOPA-l 
(Supplementary Table 1). This CpOPA-l assay was uti- 
lized for genotyping of the 129 RIL individuals along 
with the parental lines of the reference mapping 
population. 

The genotyping data of the 768 SNPs across the 
129 RILs were analysed using the GenomeStudio soft- 
ware (lllumina), which clusters and calls the data 
automatically, allowing visualization of the data dir- 
ectly for downstream analysis. For each SNP, the geno- 
typing data representation included three main 
clusters corresponding to AA homozygote, AB hetero- 
zygote and BB homozygote. As in the present study, an 
F9-F1 0 RIL mapping population was used, which is 
expected to contribute very few heterozygotes, most 
of the SNP markers produced two main clusters repre- 
senting the two homozygous genotypes, with some- 
times a small additional cluster corresponding to 
heterozygous genotypes (Fig. 1A). A few data points 
were sometimes ambiguously located outside these 
clusters (indicated by arrows in Fig. 1A) and repre- 
sented those for which no calls were generated and 
were therefore scored as missing data. In our genotyp- 
ing data set, the average level of heterozygosity was 
6.5%, expected for a RIL population, whereas the 
level of missing data per marker averaged at 5.6%. 

In the present study, of the 768 assays, 724 SNPs gen- 
erated clearly defined, well-separated clusters (as in 
Fig. 1A), whereas the remaining 44 SNPs did not 
produce clear clustering patterns and were excluded 
from the linkage analysis. These consisted of 28 false 
or monomorphic SNPs (failed to detect an SNP in the 
parents and the mapping population) that grouped 
into a single cluster (Fig. 1 B), and 1 6 SNPs that were 
classified as technically unsuccessful, since they had 
the GenTrain and GenCall50 score of <0.40, 'no-call' 
frequencies >5% and were represented by insufficient 
allele cluster separation (Fig. 1C). Further, of the 724 
SNP assays that showed clear clustering, 2 7 were 
found to be heterozygous in at least one of the 
parentsand hence were excluded from linkage analysis. 
Therefore in all, 697 SNPs (645 CaSNPs and 52 ESNPs) 
out of 768 SNPs were found to be successful, demon- 
strating a success rate of 90.7 5% for the lllumina 
GGGT in chickpea. 

3.3. SNP mapping 

For construction of a dense genetic map of chick- 
pea, the genotyping data of 697 polymorphic SNPs 
obtained from 1 2 9 RILs of the reference inter-specific 
mapping population (C. arietinum ICC4958 x C. reti- 
culatum PI489777) were utilized. Further, to gener- 
ate an integrated and advanced genetic map of 
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Figure 1. Representative clustering patterns generated by the lllumina GoIdenGate SNP Genotyping assay. For a given SNP marker, 
genotypes are called for each sample (dots) by their normalized signal intensity (Norm R, y-axis), i.e. sum of intensities of two 
fluorescent signals, and allele frequency (Norm theta, x-axis) relative to a cluster position (shaded area). The data point colour codes 
represent: red, AA (homozygous); blue, BB (homozygous); purple, AB (heterozygous); black, no call (missing data). (A) High-quality 
SNPs (e.g. CaSNP116 and CaSNP290) showing well-separated clusters of homozygous alleles (red and blue) and heterozygotes 
(purple). Some data points located between or in the border of these clusters (marked by an arrow) are unsuccessfully genotyped 
samples for which no calls were generated and considered as missing data. (B) SNPs which were considered as false or 
monomorphic (failed to detect an SNP in the parents and the mapping population) that grouped into a single cluster (e.g. 
CaSNP91 7 and ESNP46). (C) Technically unsatisfactory SNPs (e.g. CaSNP61 and CaSNP226) represented by insufficient allele cluster 
separation. 
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chickpea, the genotyping data of the 697 SNPs were 
compiled along with the genotyping data of 368 co- 
dominant markers published previously by our 
group. 48 These markers included 238 gSSRs 
(genomic SSRs), 52 EST-SSRs, 51 ITPs (Intron 
Targeted Polymorphism), 2 5 ESTPs and 2 MtESTs 
(Table 1 ), which served as a framework for anchoring 
the SNP loci. Hence, data of 1 065 markers (697 SNPs 
and 368 previously mapped co-dominant markers) 
were compiled together for linkage analysis and 
map generation using JoinMap ver. 4.0. 67 The result- 
ing integrated linkage map of chickpea defined map 
positions of 1063 markers distributed over 8 
linkage groups (Fig. 2). The current map spanned 
1 808.7 cM with an average inter-marker distance of 
1.7 cM. The map contained an average of 1.43 map 
positions per Mb of genome (1063 map positions/ 
740 Mb) and had one marker per 696 kb 
(740 Mb/1 063) when considering the chickpea 
genome to be 740 Mb. 44 The LGs were numbered 
(1 to 8) based on the common marker positions 
shared between corresponding LGs from previous 
studies. 48,58 The genetic length of the LGs ranged 
from 186.8cM (LG2) to 269.5 cM (LG1) (Table 2). 
LG3 was the most saturated, having 205 markers 
with an average marker density of 0.95 cM, whereas 
LG8 had the least number of markers (only 84) 
(Table 2). On an average, one linkage group con- 
tained 132.9 markers that spanned an average of 
226 cM. All categories of markers including genie, 
genomic and SNPs were found on each of the LGs. 
However, the number of SNP markers in each of the 
LGs was maximum and varied in the range 65- 
75% except in LG7, which had only 38% SNP 
markers. The markers were unevenly and non-ran- 
domly distributed in the LGs and 2 5 clusters of 
markers (>5 markers within 1 cM distance) were 
identified of which 2 0 clusters comprise only SNP 
markers, whereas a single cluster on LG3 contained 
only STMS markers. Moreover, nine gaps ranging 
from 20 to 35 cM were observed in four LGs (LG1, 
LG5, LG7 and LG8) located near the distal ends. 
The x 2 test performed resulted in segregation distor- 
tion for 42% (448) of the marker loci. However, 446 
of these markers were finally integrated into the map 
so that the loss of genetic information related to 
these markers was minimized. Moreover, the dis- 
torted markers were found to be widely distributed 
throughout all the LGs even though the ratios 
varied from one LG to another. For example in LG1 
and LG4, <25% markers showed distortion, 
whereas in others >30% of the mapped loci were 
distorted. The overall segregation distortion of 42% 
observed in this study was similar to the 41.3 and 
38% reported earlier for the same mapping 
population. 48,58 



3.4. Synteny between chickpea and other legumes 

To identify the syntenic relationships between chick- 
pea and other legume genomes, namely M. truncatula 
and G. max, the sequence information available for 
917 of 1063 mapped loci on the current linkage 
map of chickpea was utilized (Table 3). These 
markers included 696 SNPs, 51 EST-SSRs, 94 gSSRs, 
51 ITPs and 25 ESTPs for which locus sequences were 
available. The details about the markers on each of 
the chickpea linkage groups (CaLGs) that showed hits 
with chromosomes of G. max and M. truncatula are 
described in Table 3. 

Chickpea-Medicago synteny. The chickpea- Medicago 
synteny was observed for 31 5 (34.35%) of 917 
markers for which 404 significant matches were 
obtained on different chromosomes of Medicago. 
Synteny blocks were observed between chickpea 
and most Medicago chromosomes except Mt6 
(Supplementary Fig. S1 ), indicating an evolutionary re- 
lationship between the two species. Many of the CaLGs 
were found to have syntenic blocks with more than one 
LG of Medicago. CaLG3 showed maximum synteny, 
since 68 markers showed 86 hits with Medicago chro- 
mosomes (Table 3), majority of which (35 and 23) 
were located on Mt7 and Mt5, respectively (Fig. 3). 
Similarly, synteny blocks from CaLG2 were observed 
on Mt4, Mt5 and Mt6 (Fig. 3). Moreover, CaLG1, 
CaLG4 and CaLG5 showed distinct synteny with 
Mt2, Mt1 and Mt3 chromosomes, respectively 
(Supplementary Figs. S1 and S2). Additionally, CaLG6 
and CaLG7 were seen to be highly syntenic with Mt8, 
whereas CaLG8 was found to be syntenic with Mt4 
(Supplementary Figs. S1 and S2). The number of 
markers that found hits in Medicago in comparison 
with soybean was more; however, very limited collin- 
earity was observed between the chickpea- Medicago 
syntenic blocks. 

Chickpea-soybean synteny. In chickpea-soybean 
synteny analysis, 2 72 (30%) markers found a larger 
number of hits (609) in the soybean genome 
(Table 3) in comparison with Medicago. However, 
regions with macrosynteny, microsynteny and collin- 
earity were observed. All CaLGs showed syntenic 
blocks with Gm chromosomes except CaLG7 
(Supplementary Fig. S1). In the case of soybean also 
(as with Medicago), CaLG3 (54 markers) found 
maximum hits (129) in the soybean genome, of 
which majority were found on Gm19, Gm3 and 
Gm1 (Table 3; Supplementary Fig. S3). All CaLGs 
(except CaLG7) showed multiple synteny blocks with 
the Gm chromosomes (Supplementary Fig. S1). 
CaLG5 showed clear correspondence with regions in 
Gm4 and Gm6 in which the marker order was 
almost collinear. High collinearity was also observed 
between CaLG1 with Gm13, Gm1 5 and Gm9 
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;ure 2. Inter-specific linkage map of chickpea. The inter-specific linkage map of chickpea based on RILs of C. arietinum (ICC4958) x 
C. reticulatum (PI489777) was generated using JoinMap version 4.0. The name of the linkage groups is mentioned at the top of 
each LG. Distances of the loci (cM) are shown to the left and the names of loci are shown to the right side of the linkage groups. 
SNP markers are represented in black, while previously published markers are shown in colour: red, genie markers; green, genomic 
markers; blue, STMS markers of Winter et al. 5S used for anchoring. 
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Figure 2. Continued 
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Table 2. Distribution of markers on the eight linkage groups 



LGs 


Length (cM) 


Number of mapped 
markers 


Average marker 
density (cM) 


SNP markers 
(% of total) 


Markers showing 
distortion (%) 


Other markers 


1 


269.5 


119 


2.26 


89 (74.8) 


20.2 


30 


2 


1 86.8 


1 38 


1.35 


87 (63.0) 


56.5 


51 


3 


1 96.6 


205 


0.95 


1 34 (65.4) 


45.4 


71 


4 


200.4 


1 68 


1.19 


1 20 (71 .4) 


9.5 


48 


5 


259.3 


141 


1 .83 


96 (68.0) 


82.3 


45 


6 


220.5 


108 


2.04 


67 (62.0) 


50.9 


41 


7 


263.0 


100 


2.63 


38 (38.0) 


38.0 


62 


8 


21 2.6 


84 


2.53 


65 (77.4) 


30.9 


1 9 


Total 


1 808.7 


1 063 


1.70 


696 (65.48) 


42.0 
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Table 3. Details of CaLGs, number of markers and synteny with M. truncatula and C. max chromosomes 



Chickpea Markers Markers Mt chromosome (chickpea Gm chromosome (chickpea orthologs) 

LGs selected for orthologs) 

blast 



CaLG1 


119 


107 


Mt2(35) Mt4(3) Mt1 (2) Mt5(2) 
Mt7(2) Mt3(1 ) Mt6(1 ) Mt8(1 ) 


Gm1 5(1 9) Gm1 3(1 7) Gm9(4) Gm8(3) Gm1 7(3) Gm3(2) 
Gm1 0(2) Gm1 1 (2) Cm20(2) Gm1 (1) Gm4(1) Gm5(1) 
Gm6(1) Gm7(1) Gm14(1) Gm1 9(1) 


CaLG2 


1 38 


1 1 5 


Mt4(1 9) Mt5(9) Mt3(5) Mt6(4) 
Mt2(2) Mt7(2) Mt8(2) 


Gm1 2(1 0) Gm1 1 (7) Gm6(4) Gm9(4) Gm1 5(4) Gm1 8(4) 
Gm1 (3) Gm5(3) Gm1 3(3) Gm20(3) Gm4(2) Gm7(2) 
Gm1 0(2) Gm14(2) Gm1 9(2) Gm3(1) 


CaLG3 


205 


1 79 


Mt7(35) Mt5(23) Mt3(7) 
Mt2(6) Mt6(6) Mt8(4) Mt4(2) 
Mt1 (3) 


Gm1 9(29) Gm3(26) Gm1 (1 2) Gm9(8) Gm1 0(7) Gm6(2) 
Gm7(6) Gm1 1 (6) Gm1 3(5) Gm2(4) Gm5(4) Gm1 8(4) 
Gm20(4) Gm4(3) Gm6(2) Gm8(2) Gm1 2(2) Gm1 5(2) 
Gm1 6(2) Gm14(1) Gm1 7(1) 


CaLG4 


1 68 


1 52 


Mt1 (3 1 ) Mt5(5) Mt3(3) Mt4(3) 
Mt7(2) Mt6(1) 


Gm1 0(1 8) Gm20(1 1) Gm1 7(9) Gm1 (7) Gm1 4(7) Gm8(5) 
Gm1 1 (5) Gm1 2(5) Gm3(4) Cm9(4) Gm1 8(4) Gm5(3) 
Gm6(2) Gm7(2) Gm1 3(2) Gm1 5(2) Gm2(1) Gm4(1) 
Gm19(1) 


CaLG5 


141 


1 1 9 


Mt3(34) Mt1 (5) Mt7(5) Mt2(4) 
Mt5(4) Mt4(2) Mt6(1 ) Mt8(1 ) 


Gm6(1 9) Gm4(1 8) Gm20(7) Gm1 4(6) Gm1 8(6) Gm1 3(5) 
Gm1 1 (3)Gm1 (2) Gm2(2) Gm8(2) Gm1 7(2)Gm3(1) 
Gm5(1)Gm7(1)Gm19(1) 


CaLG6 


1 08 


86 


Mt8(1 5) Mt5(7) Mt7(5) Mt2(4) 
Mt3(4) Mt4(3) Mt6(3) Mt1 (2) 


Gm8(1 1) Gm5(9) Gm1 3(4) Gm1 8(4) Gm20(4) Gm6(3) 
Gm7(3) Gm1 1 (3) Gm1 (1 ) Gm2(1 ) Gm3(1 ) Gm1 0(1) 
Gm1 2(1) Gm14(1) Gm1 6(1) Gm1 7(1) Gm1 9(1) 


CaLG7 


100 


81 


Mt8(1 3) Mt3(6) Mt2(5) Mt5(4) 
Mt7(4) Mt1 (3) Mt4(3) Mt6(1 ) 


Gm7(6) Gm1 4(6) Gm7(6) Gm1 4(6) Gm5(4) Gm6(4) 
Gm1 6(4) Gm1 7(4) Cm1 (3) Cm2(3) Cm8(3) Gm1 1 (3) 
Gm1 3(3) Gm1 8(3) Gm4(2) Cm9(2) Gm1 9(2) Gm1 2(1) 
Gm1 5(1) Gm20(1) 


CaLG8 


84 


78 


Mt4(22) Mt2(4) Mt6(4) Mt1 (3) 
Mt3(3) Mt7(3) Mt8(3) Mt5(2) 


Gm1 7(20) Gm7(8) Gm1 3(8) Gm5(6) Gm6(6) Gm4(5) 
Gm9(4) Gm1 5(4) Gm1 0(3) Gm1 4(3) Gm1 8(3) Gm1 9(3) 
Gm3(2) Gm1 1 (2) Gm1 2(2) Gm1 6(2) Gm20(2) Gm8(1) 


Total 


1 063 


91 7 


404 


609 



(Fig. 3), followed by LG4 with Gml 7. Similarly, collin- 
ear regions were also identified between CaLG6 and 
Gm5 and Gm8 (Supplementary Fig. S1). 



4. Discussion 

The present study of the chickpea genome reports 
the large-scale identification of SNPs from the NGS of 



two contrasting genotypes, which were parents of an 
RIL mapping population. The development and valid- 
ation of the high-throughput GoldenGate SNP geno- 
typing assay was demonstrated for the first time in 
chickpea with a very high success rate of 90.75% 
(since 697 of 768 SNPs were genotyped). Further, 
these SNPs were assigned map locations in the back- 
drop of other co-dominant markers to generate one 
of the most dense genetic linkage maps of chickpea, 
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Figure 3. Syntenic relationships of chickpea LGs: CaLG2 with Mt4, Mt5 and Mt6; CaLG3 with Mt5 and Mt7; CaLG1 with Gm9, Gm1 3 and 
Gm1 5; CaLG3 with Gm1, Gm3 and Gm19. Coloured regions represent syntenic blocks observed in chromosomes of Medicago and 
soybean. 
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thereby clearly establishing the quality of the data and 
the utility of this approach. This study served to expand 
the repertoire of genomic resources in chickpea by pro- 
viding the first set of 696 validated and mapped SNP 
markers, which are the most preferred next generation 
markers for molecular breeding, association studies 
and comparative genomics. Recently, there has been 
a spurt in the large-scale discovery of SNPs in plants, 
especially after the public availability of EST databases 
and more recently with the continuously expanding 
NGS of genomes and transcriptomes. SNP identifica- 
tion in plants range from a few thousand such as 
1114 in wheat,' 5 2700 in Ambidopsis, 69 5000 in 
tomato, 40 6626 in pine,' 6 10 000 in cowpea, 13 
9194-36 000 in maize,' 7 ' 42 22 000 in barley, 32 25 
000-39 022 in soybean,' 8,70 40 000 in Medicago :9 
and 67 051 in rice, 71 to millions in Ambidopsis (2.5 
million), 72 rice (2.4-6.5 million), 7,73 Medicago (3 
million) 38 and soybean (6.5 million). 74 The thousands 
of SNPs generated are being extensively used for 
various applications such as the analysis of genetic 
diversity, 22,74 map construction' 3,1 5,32,40 and 
orienting the scaffolds in whole genome sequencing 
projects.' 8,43 

SNP discovery from de novo sequencing of genomes is 
a challenge especially in the absence of reference 
genomes. In the case of chickpea, since no reference 
genome was available, in the present study, SNPs were 
deduced by sequence comparisons between chickpea 
and its wild relative, C. reticulatum, with the aim of pro- 
ducing a highly useful set of molecular markers, which 
would be directly used for constructing a dense linkage 
map of chickpea. Consensus calling was done by de 
novo assembly of the Roche/454 long reads of C. arieti- 
num ICC4958 to generate the reference sequence, 
onto which the ICC4958 SOLiD short reads were 
mapped to generate a consensus sequence. Next, SNP 
calling was done by mapping the SOLiD reads of the 
contrasting genotype, C. reticulatum PI4897 77, onto 
the consensus sequence. Moreover, in order to 
custom design a genotyping array consisting of SNPs 
well-distributed throughout the genome, a predeter- 
mined number of SNPs were selected from the largest 
sized contigs. This array proved useful for establishing 
the efficacy of the SNP genotyping strategy for high- 
resolution mapping, which is mandatory for MAS, asso- 
ciation analysis and anchoring the chickpea genome. 

Many factors may be responsible for the overall 
success of SNP genotyping. Firstly, SNP prediction 
from NGS data should be based on stringent para- 
meters. When calling variants (especially from short 
reads), the quantity of reads and the accuracy of 
mapping the reads should be precise. This was 
ensured in our data since we used stringent criteria 
for SNP calling such as base quality >20, minimum 
read depth of 5X in both genotypes and frequency 



score >0.6. Secondly, the nucleotides flanking the 
query SNP also influence the rate of success of SNP de- 
velopment.' 6,23 In our study also, care was taken to 
select only those SNPs that had no variations in the 
60 bases flanking it. Moreover, the lllumina ADT 
which provides designability scores, checks for the 
presence of repetitive and palindromic sequences, 
GC content and sequence polymorphisms in the 
region flanking the query SNP. High designability 
scores were obtained for the 920 predicted chickpea 
SNPs of which 851 (92.5%) had an ADT designability 
score >0.6 indicating a high success rate for the con- 
version of an NGS-derived SNP into a GGGT assay. 
Even after combining with the 1 02 ESNPs, designabi- 
lity scores >0.6 were obtained for 92.37% of the pre- 
dicted SNPs. From these, since only 768 SNPs had to 
be selected for the OPA design, those having scores 
>0.76 (much higher than the threshold values) 
were selected for the 768-CpOPA-l. Inclusion of 
SNPs with such high designability scores accounted 
for the overall success of the genotyping assay as has 
also been demonstrated in pine.' 6 Moreover, 48 
CaSNPs predicted from the NGS data were experimen- 
tally validated by resequencing the alleles of the two 
parental varieties and a success rate of 87.5% was 
obtained (data not shown). 

In the present study, we also demonstrated that the 
high-throughput GoldenGate technology could be 
successfully used for genotyping predicted SNPs by 
conversion into working assays. This high-throughput 
platform of lllumina, which ensures a high level of 
multiplexing and reliability, has been suitably used 
in plants, 12 '' 3,1 6,32,33,75,76 where large numbers of 
SNPs need to be surveyed. In the present chickpea 
OPA, the success rate of 90.75% was obtained, 
which was comparable with the success rates of 
89-92% reported in the previous studies in other 
crops such as barley, 75 soybean, 31 cowpea,' 3 
maize 33 and eucalyptus, 20 and was higher than 
those reported in spruce 12 and pine. 16,34 The SNP 
success rate may be assessed in various ways and dif- 
ferent SNP reliability thresholds may be used. The SNP 
success rate of 90.7% reported for chickpea was based 
on the percentage of successfully genotyped SNPs. 
However, if the criteria of the call rate (>9 5%) were 
used, the success rate of 99.8% was achieved. The 
success rate was 93.0% when considering the 
GenTrain and GenCall50 score of >0.4 as thresholds, 
as described in Grattapaglia et al. 20 

We present here the first SNP-based linkage map of 
chickpea constructed using 697 SNPs in combination 
with 368 previously mapped co-dominant markers 
including a large number of SSRs (Fig. 2). In the 
past few years, SSR markers have been the most pre- 
ferred markers for constructing linkage maps due 
to their hypervariability, co-dominant nature and 
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reproducibility, and SSR-based maps have been con- 
structed in many plants such as wheat, 77 barley, 78 
tobacco, 79 tomato 80 and Catharanthus 8 ^ and 
legumes like chickpea 46,48 and soybean. 82 However, 
SSRs were found to have limitations, since they were 
not sufficiently dense to provide the genomic coverage 
necessary for the dissection of complex traits. 
Moreover, automation in high-throughput genotyping 
of SSRs was not very common. Therefore, SNPs are now 
proposed as the most useful markers, especially for the 
construction of highly saturated maps, which may have 
very wide and novel applications in plant genomics. 
Hence, modern sequencing technologies are being 
used to generate thousands to millions of SNPs in 
many plant species as mentioned previously. 
However, when it comes to the validation of SNPs and 
their use in genotyping and map generation, there 
are only a few examples of SNP-based high-density 
maps. The mapof chickpea generated in this study con- 
tained 697 SNPs— much higher than the 357 in pine, 1 6 
480 in wheat 1 5 and 558 in grass, 41 but lower than the 
793 in tomato, 40 92 8 incowpea, 1 3 1 397 in soybean 31 
and 2943 in barley. 32 In addition to the SNPs, 368 pre- 
viously mapped co-dominant markers were positioned 
in this chickpea map thereby generating one of the 
most advanced integrated maps of chickpea compris- 
ing of 1 063 markers with an average marker density 
of 1 .7 cM. This map was more saturated in comparison 
to other published maps of chickpea such as those of 
Winter et al 58 (2077.9 cM: 303 markers: AMD of 
6.8 cM), Nayak et al 45 (2602.1 cM: 521 markers: 
AMD of 4.99 cM), Gujaria et al 47 (766.56 cM: 300 
markers: AMD of 2.552 cM) and Choudhary et al. 48 
(1 497.7 cM; 406 markers; AMD of 3.68). A distin- 
guishing feature of this mapwasthat it was constructed 
based on a single reference population (not on integra- 
tion of maps from different populations), utilizing 
actual segregation data of each of the genomic and 
genie markers listed in Table 1. Consensus maps are 
generated in order to increase the number of 
mapped markers, but may compromise on the accur- 
acy of the markerorder. Hence our map may be consid- 
ered as more accurate regarding the marker order on 
the eight linkage groups. This was confirmed as consid- 
erable conservation of marker distribution existed 
between the LGs of the current map and our previous 
map 48 as well as the anchor map of Winter et at. 58 
Moreover, this map was a major advancement over pre- 
vious maps based on co-dominant markers, 45,47,48 as 
the mapped markers were more than doubled in the 
current map. Additionally, nine genie markers (four 
EST-SSRs, two ITPs and three ESTPs), which remained 
unmapped in our previous inter-specific map, were 
also mapped onto LG2, LG3, LG5, LG6, LG7 and LG8 
in the current map. The average marker density in 



each of the LGs in the current map was also found to 
be increased (0.95-2.65) in comparison with the 
density reported earlier (1 .77-8.01 ). 48 Therefore, it 
was envisaged that this highly advanced map would 
not only be useful in chickpea breeding but would par- 
ticularly aid and complement the efforts of orienting 
scaffolds of chickpea whole genome sequence being 
generated during its denovo sequencingatthe institute. 
High-resolution genetic maps are critical for the as- 
sembly of sequence scaffolds into pseudomolecules 
corresponding to chromosomes of organisms and 
have been used for anchoring and orienting scaffolds 
in the soybean, 1 8 watermelon, 43 grape, 83 cucumber 84 
and apple 85 whole genome sequencing. 

Genomic synteny can facilitate the transfer of 
genetic information especially between orphan and 
model crops. In legumes, map-based synteny has 
been shown in several studies such as cowpea, 13 
Medicago, 86 peanut, 87 common bean 88 and aspara- 
gus bean. 89 However, due to the lack of genomic 
resources, information on genome organization and 
comparative analysis was limited in chickpea. In this 
study, comparative analysis of chickpea using genie 
and genomic marker sequences with the model 
legume Medicago and soybean was done, which 
revealed higher levels of synteny with Medicago in 
comparison with soybean. This result indicated that 
the degree of synteny declines with increasing phylo- 
genetic distance, as chickpea and Medicago both 
belong to the Galegoid clade, whereas soybean 
belongs to the Phaseoloid clade. Chromosomal rear- 
rangements seemed to have occurred since most of 
the chickpea LGs were chimeric to two or more Gm 
chromosomal segments. Moreover, some segments 
of chickpea such as those on CaLG3 were found to 
have blocks on two Gm chromosomes (Gm3 and 
Gm19), thereby indicating the phenomenon of 
genome duplication. 

In conclusion, the present study illustrates the appli- 
cation of NGS technology for the large-scale identifi- 
cation of SNPs from species without reference 
genomes such as chickpea. It also demonstrates that 
the high-throughput genotyping technologies such 
as the lllumina GoldenGate assay can be successfully 
used for genotyping and subsequently mapping the 
SNPs and integrating them with other co-dominant 
markers for the construction of a dense genetic map 
of chickpea. The availability of such large numbers 
of SNP markers, advanced genotyping technologies 
and the high-density marker map would serve as a 
foundation not only for orienting and anchoring of 
scaffolds arising out of whole genome sequencing 
projects but also for association mapping and 
synteny-based genomics in chickpea and other 
legumes. 
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